NOVEL GENES AND EXPRESSION PRODUCTS THEREFROM 

FIELD OF THE INVENTION 

5 

[0001] The present invention relates generally to the identification of 
the products of gene expression in cancerous tissue or other tissue 
associated with an aberrant medical condition. The identification of such 
expression products enables the development of a range of diagnostic and 
10 therapeutic agents. 

p [0002] In one embodiment, a gene is differentially or preferentially 

* s expressed in cancerous tissue relative to normal tissue. The identification of 

m the expression product of the gene and of the gene itself provides a means of 

2 15 developing diagnostic and therapeutic agents for the treatment, prophylaxis 

2 and diagnosis of the cancerous condition in which the gene is differentially 

u * 

ii or preferentially expressed. In another embodiment, the gene is involved in 

Q 

«i transcriptional control and hence modulating gene expression is 
!*f contemplated as a means of modulating cell regulation, 
b 20 

BACKGROUND OF THE INVENTION 

[0003] The increasing sophistication of recombinant DNA techniques is 
greatly facilitating research and development in the medical and allied health 
25 fields. This is particularly the case as the human genome sequencing project 
nears completion. However, in addition to elucidating the nucleotide 
sequence of the human genome, there is a requirement to undertake 
functional analyses of particular nucleotide sequences, especially those 
forming transcription units, i.e. genes. 

30 
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[0004] A functional analysis involves the determination of expression 
patterns. For example, some genes may be expressed preferentially or 
exclusively during particular disease conditions such as cancer or 
autoimmune conditions. The identification of such genes provides a basis for 
5 developing a range of diagnostic and therapeutic agents aimed, for example, 
at identifying expression of the gene and/ or developing protocols for down- 
regulating expression of the gene. 

[0005] In work leading up to the present invention, the inventors 
10 sought to identify genes differentially or preferentially expressed in human 
hepatocellular carcinoma. This is one of the most frequently encountered 
malignancies affecting Asia and China (Schafer and Sorrell, 1999). 

[0006] SUMMARY OF THE INVENTION 
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[0007] Throughout this specification, unless the context requires 
otherwise, the word "comprise", or variations such as "comprises" or 
"comprising", will be understood to imply the inclusion of a stated element or 
integer or group of elements or integers but not the exclusion of any other 
20 element or integer or group of elements or integers. 

[0008] A novel protein, HCC-1, is identified from the HCC-M cell line 
througR^SD^gel electrophoresis and mass spectrometry analysis of the cell 
proteome. The assemBtedJ^ST sequence of the novel protein is confirmed by 
25 a peptide mass fingerprinting an3^RAC^The coding region of Hcc- 1 cDNA 
has 630 bases, which code for the 210 amiruTScki^of the full-length protein. 
The unique DNA sequence at the 3' untranslated region^^SJ^bp) has been 
used to localize the gene to chromosome 7q22.1. A total of 690 b^at4he 5' 
untranslated region of Hcc- 1 has been identified and promoter activity haJ 
30 been demonstrated at this region. A number of uORFs, which is a common! 
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feature in proto-oncogenqs and growth factors, are noted at the 5' 
untranslated region. 

[0009] The protein HCC-l\s localized to the nucleus region of two liver 
cell lines by immunofluorescence\taining. Bioinformatics predictions show 
that the first 42 amino acids of the $>rotein have identity matches to 
heterogenous nuclear ribonucleoprotedns from various vertebrate species 
including human. The domain is also aVutative bi-helical DNA-binding 
motif. The rest of the hcc-1 amino acid sequence has no known homology in 
10 vertebrates. 

[0010] The cDNA of the hcc-1 is detected in tissue from various human 
organs. However, a marked increase in hcc- 1 cDNA level is observed in 
pancreatic adenocarcinoma. An increase in hcc- 1 cDNA level is also observed 
15 in well-differentiated hepatocellular carcinoma and its level decreases as the 
carcinoma progressed to a poorly differentiated stage. The increase in hcc-1 
levels in both types of tumor are expected due to the same developmental 
origin of the two organs. 



O 20 [0011] HCC-1 is proposed to be involved in nucleic acid binding and 
transcriptional control, and hence is involved in cell regulation. The protein 
and corresponding genetic sequence has therapeutic and diagnostic 
applications. \^ 

25 [0012] One aspect of the present invention is directed to an isolated 
nucleic acid molecule comprising a sequence of nucleotides, the expression 
of which, is differential or preferential in human hepatocellular carcinoma 
tissue or tissue from a related cancer relative to other tissue in said subject 
and/or in subjects not diagnosed with this condition. 

30 
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[0013] Another aspect of the present invention provides an isolated 
peptide, polypeptide or protein or a derivative, homologue or analogue 
thereof which protein is differentially or preferentially produced in or by 
human hepatocellular carcinoma tissue or tissue from a related cancer 
5 relative to other tissue in said subject and/or in subjects not diagnosed with 
this condition. 
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[0014] Yet another aspect of the present invention is directed to a 
modulator of expression of a nucleic acid molecule which nucleic acid 
10 molecule is differentially or preferentially expressed in human hepatocellular 
carcinoma tissue or tissue from a related cancer relative to other tissue in 
said subject and/or in subjects not diagnosed with this condition. 

[0015] Still another aspect of the present invention is directed to the 
15 use of a nucleic acid molecule, the expression of which is differential or 
preferential in human hepatocellular carcinoma tissue or tissue from a 
related cancer relative to other tissue in said subject an/or in subjects not 
diagnosed with having this condition in the manufacture of a medicament 
for the treatment of hepatocellular carcinoma or a related condition. 
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[0016] Another aspect of the present invention contemplates a method 
for diagnosing human hepatocellular carcinoma or a related condition in a 
subject or a propensity for said subject to develop human hepatocellular 
carcinoma or a related condition, said method comprising identifying 
25 expression of a gene which is differentially or preferentially expressed in 
tissue from subjects with hepatocellular carcinoma or a related condition 
relative to other tissue in said subject and/ or subjects not diagnosed with 
this condition. 
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BRIEF DESCRIPTION OF THE DRAWINGS 

[0017] Figure 1 is a representation of the nucleotide sequence of hcc-1. 

5 [0018] Figure 2 is a representation of the amino acid sequence of HCC- 
1; underlined sequences are amino acid sequences obtained by MS/ MS 
analysis. 

[0019] Figure 3 is a representation of the nucleotide sequence of hcc-1 
10 following amplification through long distance polymerase chain reaction 
(PCR) and used to construct an expression vector (873 bp). 

:^ [0020] Figure 4 is a photographic representation showing PCR 

IB amplification of hcc-1 cDNA in normal and tumor liver tissues. M: DNA size 

% 15 marker; 1, Tumor tissue; 2, Normal tissue; 3, Negative control. 

^ [0021] Figure 5 is a representation of the untranslated region of hcc-1. 

i s 

nj Underlined sequences are the minicstrones or uORFs before the start of the 
P151 coding region with the start and stop codons in bold. 

p 20 

DETAILED DESCRIPTION OF PREFERRED EMBODIMENTS 

[0022] The present invention is predicated in part on the identification 
of gene expression products substantially present in or produced by tissue 
25 in subjects diagnosed with hepatocellular carcinoma or a related condition 
but substantially absent or in a substantially reduced amount in other 
tissues in the subject or in subjects not diagnosed with this condition. 

[0023] Accordingly, one aspect of the present invention is directed to an 
30 isolated nucleic acid molecule comprising a sequence of nucleotides, the 
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expression of which, is differential or preferential in human hepatocellular 
carcinoma tissue or tissue from a related condition relative other tissue in 
said subject and/ or in to subjects not diagnosed with this condition. 



5 [0024] Reference herein to an "expression product" includes reference 
to mRNA transcribed from a nucleotide sequence of a gene and/or an amino 
acid sequence, generally in the form of a peptide, polypeptide or protein, 
translated from the mRNA molecule. Expression products may be identified 
directly or indirectly such as via a complex (e.g. tRNA-amino acid complex) 
10 or via an effect. Terms such as "expression" or "expressed" means the 
expression of a gene sequence to produce an expression product. 

[0025] The term "gene* is used in its broadest sense and includes cDNA 
m corresponding to the exons of a gene. Accordingly, reference herein to a 
15 "gene" is to be taken to include: a classical genomic gene consisting of 
transcriptional and/or translational regulatory sequences and/or a coding 
region and/ or non-translated sequences (i.e. introns, 5'- and 3'- 
untranslated sequences); or (ii) mRNA or cDNA corresponding to the coding 
regions (i.e. exons) and 5'- and 3'- untranslated sequences of the gene. 



m 
m 
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[0026] The term "gene" is also used to describe synthetic or fusion 
molecules encoding all or part of an expression product. In particular 
embodiments, the term "nucleic acid molecule" and "gene" may be used 
interchangeably. 



[0027] The ter^^ <1 dif^erential" or a related term such as "differentially" 
in relation to gene expressionrfteaq^that a gene sequence is expressed in 
one type of cell or tissue (e.g. cancerous^ttM3q^tissue) but is substantially 
not expressed in another cell or tissue. The term "prfefecential" or a related 
30 term such as □ preferentially □ in relation to gene expression^rn^ans that a 
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gene sequence is expressed atV higher level in one type of cell or tissue (e.g. 




cancerous cell or tissue) relativ&to another type of cell or tissue. The 
difference in expression levels may, for example, be from two-fold to 100-fold 
or from three-fold to 50-fold. In one embodiment, the gene is liver tissue of 



5 patients within hepatocellular carcjnoma and is substantially not expressed 
in the normal liver. 



[0028] Reference herein to a "subject" generally means a human 
subject although the present invention extends to other mammals which are 
10 capable of developing a homologous condition to human hepatocellular 
carcinoma. Such other mammals include livestock animals, laboratory test 
P animals and companion animals. 

"Eg 

TO [0029] The disease condition "hepatocellular carcinoma'' also includes 

IS 

E 15 conditions related to hepatocellular carcinoma such as at the genetic, 
^ immunological, biochemistry, physiological, or aetiological levels. The terms 
"carcinoma", "sarcoma" and "tumor" may be used interchangeably. 

ru 

yi [0030] The term "isolated" in relation to a nucleic acid molecule or an 
Q 20 expression product such as mRNA or a peptide, polypeptide or protein 

means that the nucleic acid molecule or expression product has undergone 
at least one purification step away from background material. Such a 
purification step includes gel electrophoresis, centrifugation, precipitation, 
chromatography such as HPLC or mass spectrometry such as MALDI-TOF 
25 MS. 



[0031] A "nucleic acid molecule" may be RNA (e.g. mRNA) or DNA (e.g. 
genomic DNA or cDNA) or an RNA/ DNA hybrid. A nucleic acid molecule may 
also be a gene as defined above. In one embodiment, the nucleic acid is in a 
30 vector such as an expression vector. In other embodiments, the nucleic acid 
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is in single or double stranded, linear or covalently closed circular form. The 
present invention further extends to primers, probes, sense and antisense 
molecules, and ribozymes to be subject nucleic acid molecule. 



5 [0032] The present invention further extends to the promoter region of 
the gene or functional variants of the promoter. The promoter may also be 
targeted in a therapeutic programme to modulate expression of the gene. 
Furthermore, the present invention extends to regulatory regions of the hcc- 
1 including 3* and 5' untranslated regions of the gene. Such regions may be 

10 used to genetically modulate expression of the gene. 

0 [0033] In a particularly preferred embodiment, the promoter region of 

^ the hcc-1 is defined by the nucleotide sequence set forth in SEQ ID NO:4. 

||j The present invention extends to nucleotide sequence having at least 60% 

B p 15 similarity to the nucleotide sequence set forth in SEQ ID NO:4 as well as a 

^* nucleotide sequence capable of hybridizing to the nucleotide sequence set 

^ forth in SEQ ID NO: 4 or its complementary form. 

m 

5^ [0034] The present invention further extends to expression products in 
□ 20 isolated form. Preferably, the expression product is in the form of a peptide, 
polypeptide, or protein. 

[0035] Another aspect of the present invention provides an isolated 
peptide, polypeptide, or protein, or a derivative, homologue, or analogue 
25 thereof, which protein is differentially or preferentially produced in or by 
human hepatocellular carcinoma tissue or tissue from a related cancer 
relative to other tissue in said subject and/ or in subjects not diagnosed with 
this condition. 

30 [0036] A "derivative" includes a single or multiple amino acid 
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substitution, addition and/or deletion to the amino acid sequence normally 
associated with the peptide, polypeptide, or protein. Accordingly, a 
"derivative" includes a part, portion, or fragment of the peptide, polypeptide, 
or protein. 

5 

[0037] Conveniently, the part, portion, or fragment of the peptide, 
polypeptide, or protein contains antigenic determinants such that the part, 
portion or fragment is capable of interacting with antibodies to the 
expression product or to immune cells (e.g. T cells) sensitized to the 
10 expression product. A derivative also includes polymorphic variants or 

glycosylation variants as well as any alterations to molecules associated with 
the expression product such as lipids, carbohydrates, DNA or RNA, or other 
proteins. 



=p 15 [0038] Amino acid insertional derivatives of the peptide, polypeptide, or 
ifi protein of this aspect of the present invention include amino and/or carboxyl 
L terminal fusions as well as intra- sequence insertions of single or multiple 
\y amino acids. Insertional amino acid sequence variants are those in which 
\1 one or more amino acid residues are introduced into a predetermined site in 
j~f 20 the molecule although random insertion is also possible with suitable 

screening of the resulting product. Deletional variants are characterized by 
the removal of one or more amino acids from the sequence. Substitutional 
amino acid variants are those in which at least one residue in the sequence 
has been removed and a different residue inserted in its place. 

25 

[0039] Where the peptide, polypeptide or protein is derivatized by 
amino acid substitution, the amino acids are generally replaced by other 
amino acids having like properties, such as hydrophobicity, hydrophilicity, 
electronegativity, bulky side chains, and the like. Amino acid substitutions 
30 are typically of single residues. Amino acid insertions will usually be in the 
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order of about 1-10 amino acid residues and deletions will range from about 
1-20 residues. Preferably, deletions or insertions are made in adjacent pairs, 
i.e. a deletion of two residues or insertion of two residues. 



10 



[0040] Analogues including mimetics include molecules which contain 
non-naturally occurring amino acids as well as molecules which do not 
contain amino acids but nevertheless behave functionally the same as the 
peptide, polypeptide or protein. Analogues of the subject molecules 
contemplated herein include modifications to side chains, incorporation of 
unnatural amino acids and/or their derivatives during peptide synthesis and 
the use of crosslinkers and other methods which impose conformational 
constraints on the peptide molecule or their analogues. 



W [0041] Examples of incorporating unnatural amino acids and 
£ 15 derivatives during peptide synthesis include, but are not limited to, use of 
J norleucine, 4-amino butyric acid, 4-amino-3-hydrox7-5-phenylpentanoic 
acid, 6-aminohexanoic acid, t-butylglycine, norvaline, phenylglycine, 
ornithine, sarcosine, 4-amino-3-hydroxy-6-methylheptanoic acid, 2-thienyl 
alanine and/or D-isomers of amino acids. A list of potential non-natural 
p 20 amino acids contemplated herein is shown in Table 1 . 

TABLE 1 



O 
III 



25 



30 



35 



Non-conventional 
amino acid 



Code Non-conventional 
amino acid 



Code 



a-aminobutyric acid 

a-amino-a-methylbutyrate 

aminocyclopropane- 

carboxylate 

aminoisobutyric acid 

aminonorbornyl- 

carboxylate 

cycloh xylalanine 

cyclop ntylalanine 

D-alanine 

D-arginine 

D-aspartic acid 

D-cystein 



Abu 

Mgabu 

Cpro 

Aib 
Norb 



Cp n 

Dal 

Darg 

Dasp 

Dcys 



L-N-methylalanine 

L-N-methylarginine 

L-N-methylasparagine 

L-N-methylaspartic acid 

L- N- m e thylcy steine 

L-N-methylgLutamine 

L-N-methylglutamic acid 

Chexa L-N-methylhistidine 

L-N-methylis lleucine 

L- N-m ethylleu cine 

L-N-methyllysine 

L- N-m ethylm ethionin 

L-N-methylnorleucine 



Nmala 

Nmarg 

Nmasn 

Nmasp 

Nmcys 

Nmgln 

Nmglu 

Nmhis 

Nmile 

Nmleu 

Nmlys 

Nmmet 

Nmnle 



- 10- 







D-glutamin 


Dgln 


L-N-m ethylnorvaline 


Nmnva 






D- glutamic acid 


Dglu 


L- N-m ethylornithine 


Nmom 






D-nistiaine 


unis 


L-N-m thylph nyl alanine 


Nmph 






D-isoleucine 


Lille 


L-N-m thylproline 


Nmpro 




c 

J 


D-leucine 


T"\i . 

Dleu 


L- N-m ethylserine 


Nmser 






D- lysine 


r\i«Tf« 
uiys 


L-N-m ethylthreonine 


iNmtnr 






D-m ethionine 


Dm et 


L- N-m ethyltryptophan 


iNmtrp 






D- ornithine 


Dorn 


L-N-m ethyl tyro sin e 


Nmt3T 






D- phenylalanine 


upne 


i>- in -m etnyivaiine 


Nmval 






D-proline 


Dpro 


u- in -m etnyietnyigiycine 


IVJ AT* AT 

iNmetg 






D- serine 


Dser 


L- N-m ethyl- t-bu tylglycine 


Nmtbug 






D-threonine 


utnr 


L- n o rleu cm e 


iNie 






D- tryptophan 


utrp 


L-norvaline 


Nva 






D-tyrosine 


uvyt 


a-m ethyl-am inoisobutyr ate 


Maib 




i ^ 


D- valine 


L/vai 


a - m e tny i - y- am in o d u Tyraie 


\4 Afn r\i 1 

xvigauu 






D - o - m ethyl alan in e 


Dmala 


a-m etny icy cionexyiaianine 


ivicnexa 






D-a-methylarginine 


Dmarg 


a - m e thy 1 cylco p en tyl al an in e 


K n a«^ 

Mcpen 






D-a-methylasparagine 


Dmasn 


a-m ethyl-a-napthylalanine 


ivianap 






D-a-m ethylaspartate 


Dm asp 


a-m ethy lpenicillam in e 


Mpen 






D- a-m ethylcysteine 


Dmcys 


in - am ino du tyi j giy cin e 


INglU 






D-a-methylglutamine 


urn gin 


r\-[z- am ino etnyi j giy cin e 


Naeg 


o 




D-a-methylhistidine 


umnis 


N - (3 - am in o pro py 1) gly cin e 


Norn 






D-a-m ethylisoleu cine 


Dm lie 


N-amino-a-methylbutyrate 


Nmaabu 






D-a-methylleucine 


Dm leu 


a-napthylalanine 


Anap 


m 




D-a-methyllysine 


Dmlys 


N-benzylglycine 


iNpne 






D-a-methylm ethionine 


Ummet 


N-(2-carbamylethyl)glycine 


in gin 






D- a-m ethylornithine 


Dm orn 


N- (carbarn ylm ethyl) glycine 


Nasn 






D-a-methylphenylalanine 


umpne 


N - ( 2 - carb oxy e thy 1) gly cin e 


INglU 






D-a-methylproline 


Dm pro 


in - ^car Doxym emyijgiycine 


Nasp 




D-a-m ethyl serin e 


Dmser 


in - cy cio d u tyigiy cin e 


IN CDUl 






D- a-m ethylthreonine 


jminr 


in -cycionepiyigiycine 


in cnep 


0 
: y 




u - a - m etny itryp t op n an 


umtrp 


in - cy cion exyigiy cin e 






D-a-m e thyltyro sin e 


Ttm fir 

um ty 


in - cy cio uecyigiy cin c 


1 1 cucc 


ill 




D-a-m e thy 1 valin e 


Dmval 


nT r 1 ^ a ^4 a a a v r 1 a*1 t t/^-i #^ a^ 

in - cyico aoae cyigiycine 


iNcaoa 






D-N-m ethyl alanine 


Dnmala 


in - cy ci o o c cy igiy cin e 


IN COCt 


M 




D- N-m ethylarginine 


Dnmarg 


N- cy cl o pr o pylgly cin e 


in cpro 






D-N-m ethyl asp ar agin e 


Dnmasn 


N - cy cl o u n d e cy lgly cin e 


IVT ay i ^4 

iNcuna 






D- N-m ethylaspartate 


Dnm asp 


in - \£, , ^ - aipn eny le tnyi ) giy cin e 


iNunm 






D-N-m ethylcysteine 


Dnm cys 


in - \o , o - dip n eny ip ro py i j giy cin e 


in one 






D-N-methylglutamine 


Dnmgln 


N- (3 - gu an i din o p ro py 1) gly cin e 


Narg 






D-N-metnylglu tarn ate 


Dnmglu 


N- ( 1 -hydroxy ethyl) gly cin e 


iNtnr 






D-N-m e thy lh i sti din e 


Dnmhis 


in - ^ny aroxye tnyi j j giy cine 


Nser 






D- N-m ethylisoleu cine 


unmiie 


N- (im idazolyl ethyl)) glycine 


iNms 






D-N-methylleucine 


Dnmleu 


in - \o - in u o ly ly e tny l j giy cin e 


in n tip 






D- IN -m etny 1 lysine 


Dnmlys 


N-m ethyl- Y - am in obu tyrate 


iVIw* fvn rill 

iNmgaou 






N-m ethyl cy cloh exylal an in e 


Nmchexa 


L^-iN-mei.nyimeLnionine 


uiiiiim ei 






D-N-m ethylornithine 


unmorn 


in m e tny icy ci open ty icu<uiiii c 


in mcpen 






in -m euiyigiycine 


ixctia 


j-/- in -m ctxiyipncnyicucuiixic 


LJ 1 1 ill LI 1 1 C 






11 llld.ll V1CU11111U13U LJL4 *-Jrl CILC 


1 1 lil CU *J 


D-N-m <*thvlnrolinp 


Dnmnro 




50 


N- ( 1 -m ethylpropyl)glycine 


Nile 


D- N-m ethylserine 


Dnmser 






N-(2-methylpropyl)glycine 


Nleu 


D-N-m thylthreonine 


Dnmthr 






D-N-m thy 1 tryptophan 


Dnmtrp 


N-(l-m thylethyl)glycine 


Nval 






D-N-methyltyrosine 


Dnmtyr 


N-methyla-napthylalanine 


Nmanap 






D- N-m ethylvaline 


Dnmval 


N-methylpenicillamine 


Nmpen 




55 


y-aminobutyric acid 


Gabu 


N-(p-hydroxyph nyl)glycine 


Nhtyr 






L-t-butylglycin 


Tbug 


N-(thiom thyl)glycine 


Ncys 
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L- thylglycin 


Etg 


penicillamine 


r n 


L-hom ophenylalanine 


Hpne 


L-a-m ethylalanine 


Mala 


L-a-methylarginine 


Marg 


L-a-m ethylasp aragin e 


Masn 


L-a-methylaspartat 


Masp 


L-a-m ethyl- t-butylgly cine 


lYllDUg 


L- a - m e thy Icy st e in e 


Mcys 


L/-m e tnyietnyigiycine 


Metg 


L- a - m e thy lglu t am in e 


Mgln 


L-a-m ethylglu tarn ate 


Mglu 


L- a - m e thy lhisti dine 


Mills 


L-a-m ethylhom ophenylalanine 


Mhphe 


L-a-methylisoleucine 


Mile 


IN - [z - m e tny i tn 10 e tny l j giy cm e 


[Nmet 


L-a-methylleucine 


Mleu 


L-a-methyllysine 


Mlys 


L-a-methylmethionine Mmet 




L-a-methylnorleucine 


Mnie 


L- a - m e thy In o rvalin e 


Mnva 


L-a-methylornithine 


Morn 


L-a-m e thylph e ny 1 alanine 


Mphe 


i>-a*mecnyiproiine 


Mpro 


L-a-methylserine 


Mser 


L-a-methylthreonine 


Mthr 


L-a-m ethyl tryptophan 


Mtrp 


L-a-methyltyrosine 


Mtyr 


L-a-m ethyl valine 


Mval 


L-N-m ethylhom ophenylalanine 


Nmhphe 


N-(N-(2,2-diphenylethyl) 


Nnbhm 


N-(N-(3,3-diphenylpropyl) 


Nnbhe 


carbamylm ethyl)glycine 




carbamylm ethyl)glycine 




1-carboxy- l-(2,2-diphenyl- 


Nmbc 






ethylamino)cyclopropane 









[0042] Crosslinkers can be used, for example, to stabilize 3D 
conformations, using homo-bifunctional crosslinkers such as the 
Afunctional imido esters having (CH 2 ) n spacer groups with n=l to n=6, 
glutaraldehyde, N-hydroxysuccinimide esters and hetero-bifunctional 
reagents which usually contain an amino-reactive moiety such as N- 
hydroxysuccinimide and another group specific-reactive moiety. 

[0043] All these types of modifications may be important to stabilize the 
subject expression product. This may be important if used, for example, in 
the manufacture of a vaccine or therapeutic composition or agents for use in 
detection assays. 



[0044] The present invention further contemplates chemical equivalents 
of the subject peptides, polypeptides and proteins. Chemical equivalents may 
35 not necessarily be derived from the subject molecule itself but may share 
certain conformational or functional similarities. Alternatively, chemical 
equivalents may be specifically designed to mimic certain physiochemical 
properties of the molecules. Chemical equivalents may be chemically 
synthesized or may be detected following, for example, natural product 
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screening. Preferably, a chemical equivalent is a functional equivalent. 



[0045] The amino acid variants referred to above may readily be made 
using peptide synthetic techniques well known in the art, such as solid 
5 phase peptide synthesis and the like, or by recombinant DNA manipulations. 
Techniques for making substitution mutations at predetermined sites in 
DNA having known or partially known sequence are well known and include, 
for example, M13 mutagenesis. The manipulation of DNA sequence to 
produce variant proteins which manifest as substitutional, insertional or 
10 deletional variants are conveniently described, for example, in Sambrook et 
al. (1989). 

O 

H [0046] A "homologue* as referred to herein includes an expression 

2 product having a similar structure, function, genetic origin or immunogenic 
,jp 15 profile and which may be present in the same or a different cell type or in a 
,~ different species of mammal. 

pJ [0047] In accordance with the present invention, it is proposed that the 
jil expression of a gene differentially or preferentially in hepatocellular 

3 20 carcinoma or a related condition provides a means for development of a 

range of therapeutic and diagnostic agents. In one particular case, the gene 
is associated with development, maintenance and/ or growth of 
hepatocellular carcinoma or related condition. By targeting the gene, the 
expression of the gene and/or its expression product, it is proposed herein 
25 that this will reduce or inhibit development, growth or maintenance of the 
carcinoma and /or further facilitate another form of treatment conducted 
simultaneously or sequentially with. 

[0048] Yet another aspect of the present invention is directed to a 
30 modulator of expression of a nucleic acid molecule which nucleic acid 
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molecule is differentially or preferentially expressed in human hepatocellular 
carcinoma tissue or tissue from a related cancer relative to other tissue in 
said subject and/ or in subjects not diagnosed with this condition. 

5 [0049] In a related embodiment, there is provided a modulator of an 
expression product of a nucleic acid molecule which nucleic acid molecule is 
preferentially or differentially expressed in human hepatocellular carcinoma 
relative to subjects not diagnosed with this condition. A "modulator" may be 
an antagonist or agonist. In a preferred embodiment, the modulator is an 
10 antagonist. 

p [0050] The antagonist may be an antisense molecule or sense molecule 
*M (i.e. for co-suppression), a ribozyme, a DNA or RNA binding molecule (e.g. 
fS peptide, polypeptide or protein) which prevents or reduces expression of the 
^ 15 target gene, an antibody or other molecule capable of interacting with the 
^ expression product. The antagonist may alternately reduce promoter activity 

; ' and/or 5* and/or 3' untranslated regulatory regions. 

□ 

III 

ijf [0051] One particularly useful group of antagonists are those identified 
w 20 following natural product screening or bioprospecting of sources such as a 
coral, plants, terrestrial environments, aquatic environments, micro- 
organisms and higher organisms. 

[0052] The present invention further contemplates a composition such 
25 as a pharmaceutical composition comprising the modulator (eg. antagonist) 
and one or more pharmaceutically acceptable carriers and/or diluents. 

[0053] Pharmaceutically acceptable carriers and/ or diluents include 
any and all solvents, dispersion media, coatings, antibacterial and antifungal 
30 agents, isotonic and absorption delaying agents and the like. The use of such 
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media and agents for pharmaceutical^ active substances is well known in 
the art. Except insofar as any conventional media or agent is incompatible 
with the active ingredient, use thereof in the therapeutic compositions is 
contemplated. Supplementary active ingredients can also be incorporated 
5 into the compositions. 

[0054] The pharmaceutical composition may also comprise genetic 
molecules such as a vector capable of transfecting target cells where the 
vector carries a nucleic acid molecule capable of modulating expression of a 
10 nucleic acid molecule encoding binding partner. The vector may, for 
example, be a viral vector. In this regard, a range of gene therapies are 
contemplated by the present invention including isolating certain cells, 
genetically manipulating and returning the cell to the same subject or to a 
genetically related or similar subject. 

15 



[0055] Accordingly, the present invention provides a method of treating 
hepatocellular carcinoma, or a related condition, said method comprising 
administering to a subject in need of such treatment an antagonist of a gene 
or gene product which is differentially or preferentially expressed in tissue 
P 20 from subjects with hepatocellular carcinoma or a related condition relative 
to other tissue in said subject and/or subjects not diagnosed with this 
condition. 



[0056] The present invention further provides for a method for 
25 identifying hepatocellular carcinoma or a related condition in a subject or a 
predisposition in a subject for developing such a condition. This aspect of 
the present invention is predicated in part on the identification of the 
expression product which is indicative of hepatocellular carcinoma or a 
predisposition for the development of same. 
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[0057] Still yet another aspect of the present invention is directed to the 
use of a nucleic acid molecule, the expression of which is differential or 
preferential in human hepatocellular carcinoma tissue or tissue from a 
related cancer relative to other tissue in said subject an/ or in subjects not 
5 diagnosed with having this condition in the manufacture of a medicament 
for the treatment of hepatocellular carcinoma or a related condition. 

[0058] The "expression product" may be identified by any number of 
means including the use of antibodies and probes designed to identify mRNA 
10 transcripts. Accordingly, another aspect of the present invention is directed 
to immunointeractive molecules such as antibodies to the expression 
p product and their use in the development of diagnostic assays. 

9 

W [0059] The use of monoclonal antibodies in an immunoassay is 

3rd 1 

,iE 15 particularly preferred because of the ability to produce them in large 
^ quantities and the homogeneity of the product. The preparation of 
v _ hybridoma cell lines for monoclonal antibody production derived by fusing 

fu an immortal cell line and lymphocytes sensitized against the immunogenic 

m 

preparation can be done by techniques which are well known to those who 
O 20 are skilled in the art. (See, for example, Douillard and Hoffman, 1981; Kohler 

Mi 

and Milstein, 1975; 1976). 

[0060] A wide range of immunoassay techniques are available as can be 
seen by reference to U.S. Patent Nos. 4,016,043, 4,424,279 and 4,018,653. 
25 These, of course, includes both single-site and two-site or "sandwich" assays 
of the non-competitive types, as well as in the traditional competitive binding 
assays. These assays also include direct binding of a labelled antibody to a 
target. 

30 [0061] Sandwich assays are among the most useful and commonly 
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used assays and are favoured for use in the present invention. A number of 
variations of the sandwich assay technique exist, and all are intended to be 
encompassed by the present invention. Briefly, in a typical forward assay, an 
unlabelled antibody is immobilized on a solid substrate and the sample to be 
5 tested brought into contact with the bound molecule. After a suitable period 
of incubation, for a period of time sufficient to allow formation of an 
antibody-antigen complex, a second antibody specific to the antigen, labelled 
with a reporter molecule capable of producing a detectable signal is then 
added and incubated, allowing time sufficient for the formation of another 
10 complex of antibody-antigen-labelled antibody. Any unreacted material is 
washed away, and the presence of the antigen is determined by observation 
l«3 of a signal produced by the reporter molecule. The results may either be 
^ qualitative, by simple observation of the visible signal, or may be quantitative 

"t: 

(B by comparing with a control ample containing known amounts of hapten. 

iB 

e< 15 

85=31 

^ [0062] Variations on the forward assay include a simultaneous assay, 

*i in which both sample and labelled antibody are added simultaneously to the 

m bound antibody. These techniques are well known to those skilled in the art, 

;jf including any minor variations as will be readily apparent. In accordance 

O 20 with the present invention, the sample is one which might contain an 

esse 

expression product such as a peptide, polypeptide or protein including 
mammalian cell extract, tissue biopsy, culture supernatant fluid or microbial 
or other cell extract. The sample is, therefore, generally a biological sample 
comprising biological fluid, and, as stated above, also extends to 
25 fermentation fluid and supernatant fluid such as from a cell culture. 

[0063] In a typical forward sandwich assay, a first antibody having 
specificity for the expression product or antigenic parts thereof, is either 
covalently or passively bound to a solid surface. The solid surface is typically 
30 glass or a polymer, the most commonly used polymers being cellulose, 
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poly aery lamide, nylon, polystyrene, polyvinyl chloride or polypropylene. The 
solid supports may be in the form of tubes, beads, discs of microplates, or 
any other surface suitable for conducting an immunoassay. 

5 [0064] The binding processes are well-known in the art and generally 
consist i©f-ctQSs-linking covalently binding or physically adsorbing, the 
polymer-antibodycdmglex is washed in preparation for the test sample. An 
aliquot of the sample to befest^d is then added to the solid phase complex 
and incubated for a period of time^strfjicient (e.g. 2-40 minutes or overnight 
1(N if more convenient) and under suitable conditions (e.g. from room 
> ( temperature to 25DC or above) to allow binding or&ny^subunit present in the 
antibody. Following the incubation period, the antibody subunit solid phase 
is washed and dried and incubated with a second antibody specifrsvfor a 
portion of the hapten. The second antibody is linked to a reporter molecule 
15 which is used to indicate the binding of the second antibody to the hapten. 

[0065] An alternative method involves immobilizing the target 
molecules in the biological sample and then exposing the immobilized target 
to specific antibody which may or may not be labelled with a reporter 
20 molecule. Depending on the amount of target and the strength of the 

reporter molecule signal, a bound target may be detectable by direct labelling 
with the antibody. 

[0066] Alternatively, a second labelled antibody, specific to the first 
25 antibody is exposed to the target-first antibody complex to form a target-first 
antibody- second antibody tertiary complex. The complex is detected by the 
signal emitted by the reporter molecule. 

[0067] By "reporter molecule", as used in the present specification, is 
30 meant a molecule which, by its chemical nature, provides an analytically 
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identifiable signal which allows the detection of antigen-bound antibody. 
Detection may be either qualitative or quantitative. The most commonly used 
reporter molecules in this type of assay are either enzymes, fluorophores or 
radionuclide containing molecules (i.e. radioisotopes) and chemiluminescent 
5 molecules. 



[0068] In the case of an enzyme immunoassay, an enzyme is 
conjugated to the second antibody, generally by means of glutaraldehyde or 
periodate. As will be readily recognized, however, a wide variety of different 
10 conjugation techniques exist, which are readily available to the skilled 

artisan. Commonly used enzymes include horseradish peroxidase, glucose 
oxidase, p-galactosidase and alkaline phosphatase, amongst others. The 
substrates to be used with the specific enzymes are generally chosen for the 
production, upon hydrolysis by the corresponding enzyme, of a detectable 



IB 

;p 15 color change. Examples of suitable enzymes include alkaline phosphatase 
and peroxidase. It is also possible to employ fluorogenic substrates, which 
yield a fluorescent product rather than the chromogenic substrates noted 
above. 



! 3 

■: est' 



Mi 



M 20 [0069] In all cases, the enzyme-labelled antibody is added to the first 
antibody hapten complex, allowed to bind, and then the excess reagent is 
washed away. A solution containing the appropriate substrate is then added 
to the complex of antibody-antigen-antibody. The substrate will react with 
the enzyme linked to the second antibody, giving a qualitative visual signal, 
25 which may be further quantitated, usually spectrophotometrically, to give an 
indication of the amount of hapten which was present in the sample. 
"Reporter molecule" also extends to use of cell agglutination or inhibition of 
agglutination such as red blood cells on latex beads, and the like. 



30 [0070] Alternately, fluorescent compounds, such as fluorecein and 
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rhodamine, may be chemically coupled to antibodies without altering their 
binding capacity. When activated by illumination with light of a particular 
wavelength, the fluorochrome-labelled antibody adsorbs the light energy, 
inducing a state to excitability in the molecule, followed by emission of the 
5 light at a characteristic color visually detectable with a light microscope. As 
in the EIA, the fluorescent labelled antibody is allowed to bind to the first 
antibody-hapten complex. After washing off the unbound reagent, the 
remaining tertiary complex is then exposed to the light of the appropriate 
wavelength, the fluorescence observed indicates the presence of the hapten 
10 of interest. Immunofluorescene and EIA techniques are both very well 

established in the art and are particularly preferred for the present method. 

p However, other reporter molecules, such as radioisotope, chemiluminescent, 

'J% or bioluminescent molecules, may also be employed. 

15 [0071] As stated above, when the expression product is mRNA, nucleic 
acid probes may be employed to detect the presence of the mRNA 

E; transcripts. A Northern blot is one example of detecting the presence of the 

O 

fy transcripts. PCR and solid phase detection systems may also be used. 

Q 20 [00721 The detection of the expression product according to the present 
invention is conveniently provided in kit form with compartments adapted to 
contain the reagents for conducting the assay. Such reagents include 
antibodies, nucleic acid probes, PCR primers, enzymes, and/or diluents 
amongst other compounds. 

25 

[0073] The present invention further provides for the use of a nucleic 
acid molecule the expression of which is differential or preferential in human 
hepatocellular carcinoma tissue relative to other tissue or tissue from 
subjects not diagnosed with having this condition in the manufacture of a 
30 medicament for the treatment of hepatocellular carcinoma or a related 
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condition. 

[0074] The present invention is described hereinafter with reference to 
the detection of one particular gene designated hcc- 1 from the human 
5 hepatocellular carcinoma cell line, HCC-M. The nucleotide sequence of hcc- 
1 is provided in SEQ ID NO:l. The corresponding expression product is a 
protein designated HCC- 1 and this comprises an amino acid as set forth in 
SEQ ID NO:2. A PCR extended form for use in a vector is shown in SEQ ID 
N0:3. 

10 

[0075] Reference herein to "hcc-l" includes reference to its derivatives 

□ and homologues, "derivative" and a homologue w being as hereinbefore defined. 

if* 

jl Likewise, reference herein to the a HCC-l* polypeptide includes reference to 

® all derivatives, homologues, and analogues thereof. 

^ [0076] Another aspect of the present invention provides an isolated 
^ nucleic acid molecule comprising a sequence of nucleotides substantially as 

Swap 

tti set forth in SEQ ID NO: 1 or a sequence having at least 60% similarity 

ili 

jj: thereto after optimal alignment or a sequence capable of hybridizing to SEQ 
O 20 ID NO: 1 or its complementary form under low stringency conditions and 
wherein the expression of said nucleotide sequence is differential or 
preferential in human hepatocellular carcinoma tissue relative to other 
tissue or tissue from subjects not diagnosed with this condition or a 
derivative or homologue of said nucleic acid molecule, "derivative" and 
25 'homologue" being as hereinbefore defined. 

[0077] Another aspect of the present invention provides an isolated 
polypeptide comprising an amino acid sequence substantially as set forth in 
SEQ ID NO: 2 or an amino acid sequence having at least 60% similarity 
30 thereto or an amino acid sequence encoded by SEQ ID NO: 1 or a nucleotide 
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sequence having at least 60% similarity to SEQ ID NO: 1 after optimal 
alignment or a nucleotide sequence capable of hybridizing to SEQ ID NO: 1 
under low stringency conditions or a derivative, homologue or analogue of 
the polypeptide. 

5 

[0078] The term "similarity" as used herein includes exact identity 
between compared sequences at the nucleotide or amino acid level. Where 
there is non-identity at the nucleotide level, "similarity" includes differences 
between sequences which result in different amino acids that are 
10 nevertheless related to each other at the structural, functional, biochemical 
and/ or conformational levels. Where there is non-identity at the amino acid 
p level, "similarity" includes amino acids that are nevertheless related to each 
I* other at the structural, functional, biochemical and/ or conformational levels, 
ro In a particularly preferred embodiment, nucleotide and sequence 

m 

s g 15 comparisons are made at the level of identity rather than similarity. 

i." 3 

'Mi 

in 

J = [0079] Terms used to describe sequence relationships between two or 

m more polynucleotides or polypeptides include "reference sequence", 

m 

; 7 "comparison window", "sequence similarity", "sequence identity", "percentage 
O 20 of sequence similarity", "percentage of sequence identity", "substantially 

similar" and "substantial identity". A "reference sequence" is at least 12 but 
frequently 15 to 18 and often at least 25 or above, such as 30 monomer 
units, inclusive of nucleotides and amino acid residues, in length. 



25 [0080] Because two polynucleotides may each comprise (1) a sequence 
(i.e. only a portion of the complete polynucleotide sequence) that is similar 
between the two polynucleotides, and (2) a sequence that is divergent 
between the two polynucleotides, sequence comparisons between two (or 
more) polynucleotides are typically performed by comparing sequences of the 

30 two polynucleo tides over a "comparison window" to identify and compare 
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local regions of sequence similarity. 



[0081] 



A "comparison window" refers to a conceptual segment of 



typically 12 contiguous residues that is compared to a reference sequence. 
5 The comparison window may comprise additions or deletions (i.e. gaps) of 
about 20% or less as compared to the reference sequence (which does not 
comprise additions or deletions) for optimal alignment of the two sequences 
Optimal alignment of sequences for aligning a comparison window may be 
conducted by computerised implementations of algorithms (GAP, BESTFIT, 
10 FASTA, and TFASTA in the Wisconsin Genetics Software Package Release 
7.0, Genetics Computer Group, 575 Science Drive Madison, WI, USA) or by 
O inspection and the best alignment (i.e. resulting in the highest percentage 

Ifj homology over the comparison window) generated by any of the various 
;J} methods selected. Reference also may be made to the BLAST family of 

8 p 15 programs as, for example, disclosed by Altschul et al. (1997). A detailed 
m discussion of sequence analysis can be found in Unit 19.3 of Ausubel et al. 

!U (1998). 



20 herein refers to the extent that sequences are identical or functionally or 

structurally similar on a nucleotide-by-nucleotide basis or an amino acid-by- 
amino acid basis over a window of comparison. Thus, a "percentage of 
sequence identity", for example, is calculated by comparing two optimally 
aligned sequences over the window of comparison, determining the number 

25 of positions at which the identical nucleic acid base (e.g. A, T, C, G, I) or the 
identical amino acid residue (e.g. Ala, Pro, Ser, Thr, Gly, Val, Leu, He, Phe, 
Tyr, Trp, Lys, Arg, His, Asp, Glu, Asn, Gin, Cys and Met) occurs in both 
sequences to yield the number of matched positions, dividing the number of 
matched positions by the total number of positions in the window of 

30 comparison (i.e., the window size), and multiplying the result by 100 to yield 



[0082] 



The terms "sequence similarity* and "sequence identity" as used 
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the percentage of sequence identity. 

[0083] For the purposes of the present invention, "sequence identity* 
will be understood to mean the "match percentage" calculated by the 
5 DNASIS computer program (Version 2.5 for windows; available from Hitachi 
Software engineering Co., Ltd., South San Francisco, California, USA) using 
standard defaults as used in the reference manual accompanying the 
software. Similar comments apply in relation to sequence similarity. 

10 [0084] Reference herein to a low stringency includes and encompasses 
from at least about 0 to at least about 15% v/v formamide and from at least 
Q about 1 M to at least about 2 M salt for hybridization, and at least about 1 M 
^ to at least about 2 M salt for washing conditions. Generally, low stringency is 

at from about 25-30°C to about 42°C. The temperature may be altered and 
,S 15 higher temperatures used to replace formamide and/or to give alternative 
j-pjj stringency conditions. Alternative stringency conditions may be applied 

;L. where necessary, such as medium stringency, which includes and 

\3 

iU encompasses from at least about 16% v/v to at least about 30% v/v 
\2 formamide and from at least about 0.5 M to at least about 0.9 M salt for 
20 hybridization, and at least about 0.5 M to at least about 0.9 M salt for 
washing conditions, or high stringency, which includes and encompasses 
from at least about 31% v/v to at least about 50% v/v formamide and from 
at least about 0.01 M to at least about 0. 15 M salt for hybridization, and at 
least about 0.01 M to at least about 0.15 M salt for washing conditions. In 
25 general, washing is carried out T m = 69.3 + 0.41 (G+C)% (Marmur and Doty, 
1962). However, the T m of a duplex DNA decreases by 1°C with every increase 
of 1% in the number of mismatch base pairs (Bonner and Laskey, 1974). 
Formamide is optional in these hybridization conditions, 

30 [0085] Accordingly, particularly preferred levels of stringency are 
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defined as follows: low stringency is 6 x SSC buffer, 0. 1% w/v SDS at 25- 
42°C; a moderate stringency is 2 x SSC buffer, 0. 1 /o w/v SDS at a 
temperature in the range 20°C to 65°C; high stringency is 0.1 x SSC buffer, 
0. 1% w/v SDS at a temperature of at least 65°C. 

5 

[0086] The present invention further extends in a modified nucleotide 
sequence encoding HCC- 1 where a nucleotide sequence is optimized to 
facilitate greater expression in a particular host cell. Accordingly, the 
present invention contemplates a method for the construction of a nucleic 
10 acid molecule comprising a non-naturally occurring nucleotide sequence, 
said method comprising constructing in a particular reading frame, a 
contiguous sequence of codons which encode a sequence of amino acids of a 

Cl polypeptide where one or more codons are selected to express at a higher 

SI 

( ^ level in a particular host cell or in vitro expression system relative to the 
";H 15 corresponding codons in the naturally occurring nucleotide sequence 
N encoding the same polypeptide, wherein the selected codons are preferably 
used by a host cell, and wherein the codon for Phe may be selected from the 

If group comprising UUU and UUC, the codon for Ser may be selected from the 

i y 

Hi group comprising UCU, UCC, UCA, UCG, AGU and AGC, the codon for Tyr 

q 20 may be selected from the group comprising UAU and UAC, the codon for Cys 
^ may be selected from the group comprising UGU and UGC, the codon for Trp 
may be selected from the group comprising UGG, the codon for Leu may be 
selected from the group comprising CUU, CUC, CUA, CUG, UUA and UUG, 
the codon for Pro may be selected from the group comprising CCU, CCC, 
25 CCA and CCG, the codon for His may be selected from the group comprising 
CAU and CAC, the codon for Gin may be selected from the group comprising 
CAA and CAG, the codon for Arg may be selected from the group comprising 
CGU, CGC, CGA, CGG, AGA and AGG, the codon for He may be selected 
from the group comprising AUU, AUC and AUA, the codon for Met may be 
30 selected from the group comprising AUG and GUG, the codon for Thr may be 
selected from the group comprising ACU, ACC, ACA, and ACG, the codon for 

-25- 



# 4 



Asn may be selected from the group comprising AAU and AAC, the codon for 
Lys may be selected from the group comprising AAA and AAG, the codon for 
Val may be selected from the group comprising GUU and GUC, the codon for 
Ala may be selected from the group comprising GUA, GUG, GCU, and GCC, 
5 the codon for Asp may be selected from the group comprising GCA, GCG, 
GUA and GAC, the codon for Glu may be selected from the group comprising 
GAA and GAG, and the codon for Gly may be selected from the group 
comprising GGU, GGC, GGA, and GGG. 

10 [0087] Reference herein to a "host ceir refers to a cell or cells derived 
such as from a group including but not limited bacteria, yeasts, fungi, 
plants, insects and animals. A host cell is capable of expressing a peptide, 

□ 

Ci polypeptide or protein from a nucleic acid molecule. The term "host ceir 

p may also be read as a "foreign* cell meaning that the host cell is not from the 

® 15 species or strain of organism from which a particular coding sequence or 
%| non-coding sequence is derived. The host cell may however be a genetically 
" modified form of the original source organism. In the case of a coding 

sequence or non-coding sequence derived from P. gingivalis or a related 

llJ 

ry organism, the suitable host cell for expression of a modified sequence 
m 20 includes E. coli stains such as but not limited to, WA803, WA802, RR1, 
^ Q359, Q538, P2392, NM621, NM554, NM477, MC4100, MC1061, DL538, 

DB1316, CSH18, CES200, C600hfi, C600, BNN102, BNN93, BL21(DE3), and 

BHB2690. 

25 [0088] Other suitable bacterial host cells include but are not limited to 
the following bacteria, Aminobacterium mobile DSM 12262, Aminomonas 
paucivorans DSM 12260, Asaia bogorensis J CM 10569, Bacteroides 
thetaiotaomicron BTX, Burkholderia kururiensis J CM 10599, Desulfovibrio 
dechloracetivorans SF3, Escherichia coli HS(pFamp)R, Kocuria rhbzophila DSM 

30 1 1926, Methylobacterium mesophilicum AM24, Mycobacterium avium MAC 
511, Mycobacterium avium MAC 101, Phormidium corium, Pseudomonas 
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aeruginosa ERC1, Pseudomonas aeruginosa HER- 1001, Pseudomonas 
aeruginosa HER- 1002, Pseudomonas aeruginosa HER- 10 10, Pseudomonas 
aeruginosa HER- 1009, Pseudomonas aeruginosa HER- 10 16, Pseudomonas 
aeruginosa HER- 10 17, Pseudoxanthomonas broegbernensis DSM 12573, 
5 Ralstonia gilardii LMG 5886, Shewanella frigidimarina ACAM 591, 
Shewanella gelidimarina ACAM 456, Streptococcus pneumoniae MS22, 
Streptococcus pneumoniae FilO, Streptococcus pneumoniae 51702, 
Streptococcus pneumoniae TW3 1, Streptococcus pneumoniae TW17, 
ThiomicrospirafrisiaJB-A2 t Thiomicrospira kuenenii JB-A1, Treponema 
10 lecithinolyticum OMZ 685, Treponema maltophilum BR, Treponema 
maltophilum PNA1, Treponema maltophilum H02A, and Ureaplasma 
O urealyticum. 



,2, 7A, Kluyveromyces lactis CW64- 1C, Prosthemium asterosporum A 1 , 
r ^ Prosthemium betulinum Bl, Saccharomyces cerevisiae 1A-H19 [psi-], 

fy Saccharomyces cerevisiae 5V-H19 [psi-], Saccharomyces cerevisiae 1-5V-H19, 

\* Saccharomyces cerevisiae PS-5V-H19, Saccharomyces cerevisiae C10B-H49, 
0 20 Saccharomyces cerevisiae 9V-H70 [PIN+], Saccharomyces cerevisiae 4V-H73, 
Saccharomyces cerevisiae 17G-H73, Saccharomyces cerevisiae 3B-H72, 
Saccharomyces cerevisiae DL1, Saccharomyces cerevisiae GW226, 
Saccharomyces cerevisiae JM43-GD7, Saccharomyces cerevisiae MCC318, 
Saccharomyces ceremsiae NB39-5D, Saccharomyces cerevisiae NGB108, 
25 Saccharomyces cerevisiae PTH43, Saccharomyces cerevisiae PTH352, 
Saccharomyces cerevisiae PTY1 1, Saccharomyces cerevisiae TF1 12, 
Saccharomyces cerevisiae TWM 10-41, Saccharomyces kluyveri GRY1175, 
Saccharomyces kluyveri MCC328, and Saccharomyces kluyveri NB180. 

30 [0090] Suitable mammalian host cells for expression include, the 



m [0089] Still other suitable host cells include but are not limited to the 
,C 15 following fungal cells Hyphodontia australis 231, Kluyveromyces lactis CK56- 



-27- 



% % 



mammalian cell lines including but not limited to mammalian cell line, 
22Rvl Human prostate carcinoma, A7 Human melanoma, B 13-24 Chinese 
hamster, antibody producing, EOC 2 Mouse microglia; macrophage, EOC 
13.31 Mouse microglia; macrophage, EOC 20 Mouse microglia; macrophage, 
5 HAAE-2 Human normal abdominal aorta, HS-5 Human HPV-16 E6/E7 
transformed, 1-11.15 Mouse macrophage, 1-13.35 Mouse macrophage, KMA 
Human macrophage; monocyte, NCI-BL 1770 Human Epstein-Barr 
transformed B lymphoblastoid line,NCI-BL2107 Human Epstein-Barr 
transformed B lymphoblastoid line, NCI-BL2141 Human Epstein-Barr 
10 transformed B lymphoblastoid line, NCI-H211 Human carcinoma; small cell 
lung cancer, NCI-H84 1 Human carcinoma; variant small cell lung cancer, 

□ NCI-H847 Human carcinoma; classic small cell lung cancer, NCI-H1341 
Human carcinoma; small cell lung cancer, NCI-H2122 Human 

^ adenocarcinoma; non-small cell lung cancer, RTgill-Wl Rainbow trout, 

,p 15 normal gill, F-l.CN5a.l Human erythroleukemia, TK#1 Mouse disrupted 
interferon regulatory factor 2 (IRF-2) gene, and TOV-1 12D Human primary 

J 1 ^ malignant adenocarcinoma 

jU 

\2 [0091] Reference herein to a nucleic acid molecule or nucleotide 
H 20 sequence being a non-naturally occurring" or other wise "non-natural" is 
meant to be considered in its broadest sense to include a nucleic acid 
molecule or nucleic acid sequence which has been artificially created by a 
chemical synthetic or recombinant means or by directed or controlled genetic 
processes including homologous recombination. The selection of a 
25 particularly preferred codon or nucleotide sequence is deemed here to be an 
example of rendering the resulting nucleic acid molecule or nucleotide 
sequence as non-naturally occurring. 

[0092] Reference herein to an "in vitro expression system" includes an 
30 in vitro translation system and refers to a cytoplasmic or cell extract 
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comprising molecules such that when the cell extract is provided with a 
nucleic acid sequence that encodes a peptide polypeptide or protein, the cell 
extract is competent to express the peptide polypeptide or protein. Such 
extracts may be produced from cells or tissues derived such as from but not 
5 limited to the group including bacteria, yeasts, fungi, plants, insects, and 
animals. 

[0093] The hcc- 1 nucleic acid molecule may be resident in isolated form 
as a linear, single or double stranded molecule or it may be resident in a 
10 vector such as an expression vector. 

□ [0094] The present invention further provides transgenic cells carrying 
;^ hcc-l or otherwise producing HCC-1. Such cells include bacteria, yeast, 

K insect, animal, and mammalian cells. 

m 

'f* 15 

1^ [0095] Yet another aspect of the present invention provides an 

s ^ antisense molecule to hcc-l transcript whereby the antisense molecule 

jy reduces expression of hcc-l by from about 5% to about 100% or from about 
10% to about 80% or from about 20% to about 70% relative to a control. 

M 20 

1*4* 

[0096] The hcc- 1 gene is expressed in hepatocellular carcinoma tissue 
but is substantially not expressed in other tissue. The gene and its 
expression product, HCC-1, provide a convenient marker for the cancer 
condition and /or for the development of antagonists of hcc-l expression or 
25 HCC-1 activity. 

[0097] It is proposed that HCC- 1 is involved in nucleic acid binding and 
transcriptional control. Modulating expression of hcc-l or modulating HCC-1 
activity provides a means of modulating cell regulation. Accordingly, another 
30 aspect of the present invention contemplates a method of modulating one or 
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more activities within a cell, said method comprising modulating expression 
of hcc- 1 gene expression or the activity of HCC- 1 for a time and under 
conditions sufficient to modulate the cell activity. 

5 [0098] Reference to cell activity includes at least one physiological, 

biochemical, immunological, or other biological property within the cell or on 
the cell surface. For example, in so far as HCC-1 is involved in transcription, 
increasing levels of HCC- 1 or decreasing levels of this protein will effect the 
level of transcription of the target gene. 

10 

[0099] The present invention is further described by the following non- 
p limiting Examples. 

g EXAMPLE 1 - Culture techniques 

# 15 

,2 [00100] The HCC-M cell line was cultured in Dulbelcco's modified Eagle 

^ medium (DMEM) from Gibco BRL (Life Technologies, Gaithersburg, MD, USA) 

IU containing 10% v/v fetal calf serum (FCS) from Biological Industries 

ill 

\2 (Haemek, Israel) at 37°C in 5% C0 2 /95% air at 95% relative humidity. The 

p 20 cells were harvested once a monolayer culture was attained. During 

i 

harvesting, the cells were rinsed with DMEM without FCS. Cell 
detachment was performed by incubation with a solution of 0.5 g/L trypsin 
and 0.2 g/L ethylenediaminetetraacetic acid [EDTA] (Gibco BRL). After 15 
mins, DMEM containing FCS was added to terminate the action of the 
25 protease. The resulting suspension was centrifuged at 2000 rpm for 5 mins 
at 4°C. After discarding the supernatant fluid, the cells were resuspended 
with DMEM without FCS and centrifuged at 10000 rpm for 5 mins at 4°C. 
After centrifugation, the supernatant was removed and the cell pellet stored 
at -80°C until further use. 

30 
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EXAMPLE 2 - Sample preparation 

1001011 Harvested HCC-M cells were disrupted with a cocktail of 7 M 
urea (Bio-Rad Laboratories, Hercules, CA, USA). 2 M thiourea (Fluke Chemie 
AG Buchs, Switzerland), 4% v/v 3.[(3-cholamido P ropyl)dimethylammomo]- 
1-propanesulphonate (CHAPS) (USB, Amersham Pharmacia Biotech AB, 
Uppsala, Sweden), 40 mM tris(hydroxymethyl)aminomethane (Tns) (J. T. 
Baker, Phillipsburg, NJ, USA) and 1 mM phenylmethylsulphonyl fluonde 
(PMSF) (Sigma Chemical Co., St. Louis, MO, USA). The resulting cell lysate 
was subjected to physical shearing by passing it through a syringe fitted 
with a 21G needle, followed by syringes with 25G and 27G needles 
successively, and the addition of 50 ug/ml DNase I (from bovine pancreas 
grade II, Boehringer Mannheim, GmbH, Mannheim, Germany) and 50 ug/ml 
RNase A (from bovine pancreas, Boehringer Mannheim). The sample was 
then centrifuged using a Beckman TL-100 Tabletop Ultracentrifuge (Palo 
Alto, CA, USA) at 85000 rpm (297785 x g) for 2 hrs at 15<>C. 

EXAMPLE 3 - Two-dimensional gel electrophoresis 

[00102] The first dimensional IEF was performed on precast 18 cm IPG 
strips (Amersham Pharmacia Biotech) at 20DC with a maximum current 
setting of 50 uA/strip using an Amersham Pharmacia IPGphor IEF unit. The 
strips were rehydrated for a minimum of 10 hrs in ceramic strip holders in 
350 DL of sample containing 7 M urea, 2 M thiourea, v/v CHAPS, 1 mM 
PMSF, 20 mM dithiothreitol (DTT) (Bio-Rad) and 0.5% v/v IPG buffer 
(Amersham Pharmacia Biotech). The amount of protein loaded was -150 ug 
for analytical gels and -400 ug protein for preparative gels. A low voltage of 
30 V was applied during rehydration. After rehydration, IEF run was carried 
out using the following conditions: (i) 500 V, 500 Vhr; (ii) 1,000 V, 1000 Vhr; 
and (iii) 8000 V, 32000 Vhr. Voltage increases were performed on a step- 
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wise basis. Before carrying out the second-dimensional sodium dodecyl 
sulphate-polyacrylamide gel electrophoresis (SDS-PAGE), the strips were 
subjected to a two-step equilibration. The first was an equilibration buffer 
consisting of 6 M urea, 30% v/v glycerol (BDH Laboratory Supplies, Poole, 
England), 2% w/v SDS (Merck KGaA, Darmstadt, Germany), 50 mM Tris-HCl 
(p H 6 8) and 1% w/v DTT. The second step was with a buffer consisting of 6 
M urea, 30% v/v glycerol, 2% w/v SDS, 50 mM Tris-HCl (pH 8.8) and 2.5% 
w/v iodoacetamide (IAA) (Sigma). After the IPG strips were transferred onto 
the second-dimension SDS-PAGE gel, the strips were sealed in place with 
0 75% agarose (USB). SDS-PAGE was performed on 1.0 mm thick 10% and 
10% w/v polyacrylamide gels at a constant voltage of 1 10 V at 10°C using an 
Amersham Pharmacia Iso-Dalt electrophoresis unit. 



EXAMPLE 4 - Silver staining 



[00103] Silver staining of the gels was performed using published 
procedures with some modifications. The gels were fixed in 50% v/v 
methanol (Merck), 5% v/v acetic acid (Merck) in water for 30 mins followed 
by washing in 50% methanol in water for 10 mins. Then the gels were 
washed again with water for 60 mins and sensitized with 0.02% sodium 
thiosulphate (Merck) for 2 mins. After the gels were rinsed twice with water 
for 1 min each, they were incubated in chilled 0.1% w/v silver nitrate 
(Merck) for 40 mins at 4°C. After discarding the silver nitrate and rinsing 
with two changes of distilled water for 1 min each, the gels were developed in 
0.04% v/v formalin (35% v/v formaldehyde in water) (Merck) in 2% w/v 
sodium carbonate (Merck). When the desired intensity was attained, the 
developer was discarded and the gel incubated with 1.46% w/v EDTA 
disodium dihydrate (Bio-Rad) for 10 mins to stop the development. The 
staining procedure was completed by three rinses with water for 5 mins 
each Stained gels were scanned using a Molecular Dynamics Personal 
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Densitometer SI (Sunnyvale, CA, USA). 
EXAMPLE 5 - Image analysis 

[00104] The gels were analyzed by traditional eyeballing method and the 
PDQuest (Version 6.1) software from Bio-Rad Laboratories. Using the Spot 
Detection Wizard function, the scanned gels were processed to remove 
vertical and horizontal streaks and enhance the spots before crosshairs were 
placed on the detected spots. 

EXAMPLE 6 - Enzymatic digestion of protein spots 



[00105] Silver stained spots were excised manually with a homemade 
plasticNplunger and transferred to a 96-well polypropylene microtitre plate. 
15 Each excised spots was washed with 175 4>L of 25 mM Tris-HCl (pH 8.5) in 
50% acetonitnteJApplied Biosystems, Foster City, CA, USA). The plate was 
sealed with an adhesive film and stored at 4°C for at least 24 hrs. This step 
was critical for the equilibration of gel spots as it allowed for more efficient 
enzyme digestion. Prior to th^addition of trypsin, the washing solution was 
20 replaced with a fresh aliquot of solution and plates were incubated with 
shaking for 20 mins at 37°C. The waslimg solution was then aspirated and 
gel spots were dried in a Savant AutomaticN^nvironment SpeedVac AES2010 
centrifugal concentrator (Holbrook, NY, USA) fbf^30 mins. Enzymatic 
digestion was performed with the addition of 10 4>I>Qf 0.02 ug/°L trypsin 
25 (Promega Corporation, Madison, WI, USA) in 25 mM ammonium bicarbonate 
(pH 8.5) (Sigma) to each gel piece and incubated at 37 DC overnight with 
shaking. To enhance peptide extraction, 10 <£L of 0. 1% trifluohjacetic acid 
(TFA) (Sigma) in 50% acetonitrile was added to each well and the^mcrotitre 
plate sonicated for 10 mins in an ultrasonic water bath (Crest Ultrasonics, 
30 NJ, USA). 
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EXAMPLE 7 - Matrix-assisted laser desorption/ionization - Time of Flight 
(MALDI-TOF)-MS analysis of tryptic peptides 

[<Xm>6] Mass analyses were performed according to a previously 
published methods using a PerSeptive Biosystems Voyager-DE STR MALDI 
TOP MSWingham, MA, USA). In essence, 1 OL of the extracted sample 
from each oW microtitre wells was dispensed onto a MALDI sample plate 
along w iUl , D ^ atrix mautka (1Q mg/mLa-cyano-4-hydroxycinnamic 
aad (Slgma) , o.l»/^, 50% acetonjtrile) The samp]es ^ ^ 

under ambient conditions. Por each sample, the average of 256 spectra was 
acquired in the delayed extraction and reflector mode. The average of 4 scans 
(each containing 64 spectral passed the accepted criterion of peak 
mtensiiy was automatically seized and saved. Spectra were automatically 
ISScahbrated upon acquisition usingWpoint calibration with residual 
' /porcine bypsin autolytic fragments (84^51 and 2210.10 [M+H-J ions) 
, Ass lg nment of peaks was done manually\easured peptide masses were 
; XClUded * ,hdr — responded to tryV, autodigestion products or 
from ,dentified proteins adjacent to the spot be«\analyzed. 
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EXAMPLE 8 - Quadrupole-Time-Of-Flight (TOP) TandemNMS analysis of 
tryptic peptides 

[00107] De novo peptide sequencing was performed using a PE Sciex 
QSTAR9 tandem mass spectrometry system (Concord, Ontario, Canada) 
The tryptic digested protein samp.e cleanup was conducted using toe C„ Zip 
T.P (Muhpore) and eluted with 3 pi of 60% v/v methanol/5% v/v formic acid 
One pi sample was loaded onto the spray needle for nanospray (Protana 
DK) analysis. The spray was started by applying a spray potential of 800 
volts. The spray lasted for about 25 mins for each sample. QSTAR was 
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operated with resolution of about 10,000 FWHM. Data acquisition were 
done using TOF Tune software and data were processed using Biomultiview 
software. The «y and «b» ions weightage were used to get the sequence from 
MS/MS of peptides. 

EXAMPLE 9 - Database searching and identification of proteins 

[00108] The proteins were identified by searching in SWISS-PROT and 
NCBI non-redundant databases using MS-Fit (Protein Prospector, UCSF, San 
Francisco, USA). All mass searches were performed using a mass window 
between 1000 and 10000 Da, and included human and mouse sequences. 
The search parameters allowed for oxidation of methionine, N-terminal 
acetylation, carboxyamidomethylation of cysteine and phosphorylation of 
serine, threonine and tyrosine. The criteria for positive identification of 
proteins were set as follows: (i) at least four matching peptide masses; (ii) 
50 ppm or better mass accuracy; and (iii) identified proteinsD molecular 
weight and pi should match estimated values obtained from image analysis. 

EXAMPLE 10 - Identification of hcc-1 

[00109] Proteins from complex cell lysates were obtained from tissue 
samples or cell lines and separated using two-dimensional SDS-PAGE. The 
separated proteins were then excised from the gel and subjected to an in-gel 
enzymatic digest. The resultant peptides were then analyzed using a MALDI- 
TOF MS and a Quadrupole-TOF Tandem MS. Database searches were 
performed with the mass spectrometric data obtained. 



[001 10] The present invention arose initially using the MS/MS data 
obtained from the Quadrupole-TOF Tandem MS. The sequences of four 
peptide fragments were identified by this method. These data were used to 
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search the protein databases and no matches with any known proteins were 
found. 

[001 1 1] The HCC-M cells were grown to confluence and RNA of the cells 
were extracted through a standard guanidine isothiocyanate method 
(Chomczynski and Sacchi, 1987). Poly-A RNA was then purified from the 
RNA through poly-T resin binding. DNA primers were made based on the 
peptide sequences and a rapid amplification of cDNA ends (RACE) was 
performed on the poly-A RNA. The 5'-RACE and 3'-RACE results were 
compared and stitched together to give a full-length gene of 894 bases 
(Figure 1; hcc-1; SEQ ID NO:l). The 3'-RACE product contained the poly-A 
tail of the gene. 

[001 12] The open reading frame (ORF) of this novel gene was determined 
from the various possible ORFs to contain a protein of 210 amino acids in 
length (Figure 2; HCC-1; SEQ ID NO:2). The novel protein has a theoretical 
pi and molecular weight of 6.1 and 23.6 kD, respectively. DNA primers at the 
extreme ends of this novel gene (the sense primer were situated before the 
start codon at the 5' end and the anti-sense primer before the poly-A tail at 
the 3' end) were synthesized and a long distant polymerase chain reaction 
(PCR) was then performed on the HCC-M poly-A RNA. 

[001 13] The product (873 bp, Figure 3; SEQ ID NO:3) was TA cloned into 
pGEM-T (Promega, Inc., USA) and subsequently transferred to an expression 
vector pQE-30 (Qiagen, Germany). Host for the above two vectors was 
Escherichia coli strain DH5a. The expression product contained a 6 x His tag 
at the amino end of the protein. 

[00114] A multiple tissue panel containing 1st strand cDNA from both 
human normal and tumor tissues was obtained from Clontech (USA). Highly 
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specific primers (T m -70DC) were generated based on the novel gene sequence 
and used to perform a PCR screening on the multiple tissue panel. Results 
are as shown in Table 2. Human healthy liver tissue (obtained during liver 
transplant operation) and a commercial human normal liver cDNA library 
(Gibco BRL, USA) were also found to express this gene at low abundance. 



TABLE 2 Multiple tissue panel* screening for the novel gene 



Bill 



iiiliit'si 



Colon 



Breast carcinoma (GI-101) 



+/- 



Ovary 



Lung carcinoma (LX-1) 



Peripheral blood 
leukocyte 



Colon adenocarcinoma (CX-1) 



Prostate 



+/- 



Lung (GI-117) 



Small intestine 



Prostatic adenocarcinoma 
(PC3) 



Spleen 



Colon adenocarcinoma 
(GI-112) 



Testis 



++ 



Ovarian carcinoma (GI-102) 



Thymus 



Pancreatic adenocarcinoma 
(GI-103) 



+++ 



Brain 



Heart 



Kidney 



Liver 



+/■ 



Lung 



Pancreas 



Placenta 



1st stand cDNA 

Tumor tissues were propagated as xenograft in athymic nude mice. 



EXAMPLE 11 - Comparison of Expression of hcc-1 in normal versus tumor 
liver tissues 
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[00115] Figure 4 shows that the hccl-1 gene is preferentially transcribed 
in the liver of a subject with hepatocellular carcinoma but is substantially 
not transcribed in the normal liver. In this experiment, an equal amount of 
cDNA from a normal liver tissue and from tissue of a subject diagnosed with 
5 hepatocellular carcinoma were amplified by PCR at high stringency, hcc- 1 in 
tumor tissue is found to be about 10 to 50 times the amount in normal liver 
tissue (Figure 4). 



EXAMPLE 12 - Chromosome Localization 

10 

[001 16] The chromosomal location of the Hcc- 1 was identified by 
radiation hybrid mapping of the human genome (Barrett 1997). Two human 
*Q radiation hybrid-mapping panels were used for this purpose. The 

j5 Genebridge4 panel is adopted by the European Consortium on Radiation 

^ 15 Hybrid Mapping and is widely used in genome mapping projects, while the 
H Stanford G3 panel is created at the Stanford Human Genome Centre for 

m 

medium resolution chromosome localization of markers. Briefly, DNA from 
;ff each of the 93 cell lines from Genebridge 4 and 83 cell lines from Stanford 
SU G3 were used as PCR template for primers designed from the 3'-untanslated 

q 20 region of Hcc-L The results were scored for the presence and absence of a 
!s * PCR product from Hcc-L These data were then submitted to 

Whitehead/ MIT RH server (for Genebridge 4) and Stanford Human Genome 
Center (for Stanford G3) where it was tested against the framework markers 
that have already been assayed. The placement of the gene was the best 
25 possible placement when scored against the framework markers at the time 
of experiment. Hcc-1 is assigned to chromosome 7 at position 7q22.1, 3.36 
cR from marker D75651. 



30 



EXAMPLE 13 - Sub-cellular Localization 

[00117] Antibody against Hcc-1 protein was raised in rabbits. Hcc-1 
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protein sub-cellular localization was performed on Huh7 and HCC-M cells by 
immunofluorescent staining. The cells were grown on glass cover slip, fixed 
with paraformaldehyde and detected with antibody against Hcc-L Co- 
localization was performed with antibodies against mitochondria and golgi 
5 body (Lab Vision). The images from the individual antibody staining were 
scanned by confocal microscope and overlaid to form a composite image. 
Hcc-1 protein was localized to the nucleus. 

EXAMPLE 14 - Immunological Studies 

10 

[00118] Antibodies against cloned Hcc-1 protein was raised in rabbits. 
3 Its sensitivity and specificity were verified by Western blots detection of HCC- 
y M lysate in 2D gel. However, Hcc-1 protein was not detectable in Western 

blots of 2D gel electrophoresis or ID SDS-PAGE of human liver tissues. Hcc- 
p 15 1 cDNA expression levels in two paired (non-tumor and hepatocellular 
Jjj carcinoma) human liver tissues are as followed. Both subjects were positive 

L for hepatitis B virus infection. Subject A had well differentiated tumor while 

□ 

U subject B has poorly differentiated tumor. 

H-Ji 

=3 20 TABLE 3: Expression of cDNA in Tumor 



and Non-Tumor Tissue 



Subject 


Liver Tissue 


cDNA 


A 


Non-tumor 






Tumor 


+++ 


B 


Non-tumor 


++ 




Tumor 


+ 



[00119] From the above studies, it can be seen that Hcc-1 is 
differentially expressed. Its cDNA levels were raised in pancreatic 
25 adenocarcinoma as compared to healthy pancreas (see Table 3). It is also 
increased in well-differentiated hepatocellular carcinoma and its level 
seemed to decrease as the tumor progressed to poorly differentiated 
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hepatocellular carcinoma. The pancreas and liver have the same 
developmental origin (Bock et al. 1997) and Hcc-1 is increased in both types 
of tumor. 

5 EXAMPLE 15 - Promoter Study for P-151 

[00120] Four libraries of uncloned, adaptor- ligated high quality human 
genomic DNA fragments were obtained from Clontech, Inc (USA). Nested PCR 
was performed with primers derived from the adaptors and known Hcc- 1 
10 gene sequence at the S'-untranslated region and exon 1 sequence. Two of the 
libraries were amplifiable (with DNA product of 690 bp and 3.8 kb 
respectively). The PCR products were TA cloned and sequenced. The DNA 
*fl sequence for the 690 bp fragment is shown in Figure 5. Multiple mini- 

iji cistrones were noted from nucleotide sequence 300 to 690 bp. 

HI [00121] The 690 bp fragment was then ligated to a vector lacking 

I eukaryotic promoter and enhancer sequences (pSEAP2 from Clontech, Inc). 

y The vector contains a secreted human placental alkaline phosphatase gene 

i y 

fJ (SEAP) downstream of the multiple cloning sites. The construct (5 |ig) were 

O 20 transfected by a liposome-based transfection reagent (Clontech, Inc) into 
r " mammalian Huh7 cells. Normalization was performed by co-transformation 

with a vector containing the lacZ gene. 

[00122] Promoter activity was determined by assaying for the secreted 
25 alkaline phosphatase activity 48 hours post-transfection using the 

fluorescent substrate 4-methylumbelliferyl phosphate (MUP). Low promoter 
activity was observed (10 ng SEAP expressed per 5 jig DNA). When the SV40 
early promoter was added to the vector, increased SEAP transcription was 
observed (90 ng SEAP expressed). However, high transcription activity was 
30 obtained when the 690 bp fragment was constructed into a vector containing 
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SV40 early enhancer sequence (190 ng SEAP expressed). This indicates that 
an enhancer element is needed for the transcriptional activity of the Hcc- 1 
promoter. 

5 [00123] To bypass the mini-cistrones, 274 bp from the 5' end of the 690 

bp fragment was amplified and inserted into the pSEAP2 vector. No activity 

was observed when the pSEAP2 vector was constructed without SV40 early 

enhancer or promoter sequences. Transcriptional activity was observed at 

half (1 10 ng of SEAP expressed) of that from 690 bp fragment when the SV40 

10 early enhancer sequence was included in the construct. The results showed 

that the promoter region is located primarily at the middle of the identified 

Q S'-unstranslated region of the Hcc-1 gene. The enhancer sequence is 

!jj probably further upstream from the 690 bp sequence, 

m 



£ 15 [00124] Promoter region was predicted from 294 to 544 bp by ProScan 

''■--J 

ef{ (ver 1.7). This is in accordance with the promoter studies above where the 

S L 274 bp fragment at the 5' end has less transcriptional activity compared to 

P 

1U the complete 690 bp fragment. 

y* 

H 20 [00125] The occurrence of a long 5' untranslated region with mini- 

fsss 

cistrones or upstream open reading frames (uORFs) is not uncommon. It is 
found in a number of proto-oncogenes and growth factors (Willis 1999). It is 
a structure used in transcriptional regulation and translational control 
(Brown & Schreiber 1996; Clemens & Bommer 1999) of genes whose 
25 products are important for cell growth. 

EXAMPLE 16 - Bioinformatics Findings on Hcc-1 



[00126] Tlre^eR^eDiedDomain Database (CDD) with Reverse Position 
30 Specific BLAST search on the l-4^aiTiine-acids of Hcc-1 gave the result as a 
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SAP domain (e-value of 5e-04), which is a putative bi-helical DNA-binding motif 
predicted to be involved in chromosomal organization and transcriptional 
regulations (Massari & Murre 2000) found in diverse nuclear proteins. This is 
supported qy PredictProtein where amino acid sequence 197-203 was predicted 
to contain tfre nuclear localization signal. There is no predicted trans- 
membrane segment (using TMAP and PredictProtein), no mitochondrial 
targeting sequente (PSORT), and no secretory signal (SignalP). 



10 



15 




[00 127] Using PSl-BLAST on non-redundant database, amino acid sequence 
1-42 of Hcc-1 wa\ matched to vertebrate heterogenous nuclear 
ribonucleoprotein with identities match of above 45%: 

• Heterogenous nucleakribonucleoprotein U (AF073992) of Mus musculus 
[Expect = 0.005, Identifies = 21/42 (50%), Positives = 29/42 (69%)] 

• SP120 (D 14048) (nuclear scaffold protein that binds the matrix 
attachment region DNA) of Ifattus norvegicus 

[Expect = 0.005, Identities = 2V 42 (50%), Positives = 29/42 (69%)] 

• ROU_HUMAN Heterogenous nuclear ribonucleoprotein U (HNRNP U) 
(Scaffold Attachment Factor A) (SAF-A) (Q00839) of Homo sapiens 
[Expect = 0.012 Identities = 20/42 (4^%), Positives = 29/42 (68%)] 
hnRNP U protein (X65488) of Homo sapiens 

[Expect = 0.012, Identities = 20/42 (47%K Positives = 29/42 (68%)] 
Scaffold attachment factor A (AF068847) orXenopus laevis 
[Expect = 0.021, Identities = 20/37 (54%), Positives = 26/37 (70%)] 



25 [00128] 
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Using FASTA3 on SWALL non-redundant N^atabase, Hcc-1 was 
matched to various invertebrate translated proteins with E-value below 0.03: 



• Q9VHC8 CG8149 protein of Drosophila melanogaster^ 
[Expect=8e-06] 

• Q9N3G0 Hypothetical protein Y53G8AR.d of Caenort 
[Expect=0.0005] 



iditis elegans 
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• Q9LZ08 Hypothetical 22.8 KDA protein of Arabidopsis thaliana 
[Expert=0.021] 

• 07487 l^pnserved hypothetical protein of Schizosaccharomyces pombe 
(Fission yea£ 
[Expect=0.024] 

[00129] Physically, this^cc- 1 protein may have 2 to 3 domains from coiled- 
coil and low complexity region predictions: 

• PredictProtein Coiled-Coil prediction - the coil is most probably at 30-5 1 
positions. The next possible coifed-coil is at 146-160 positions. Coiled-coil 
most probably separates the different domains. 
COILS ver 2.2 (Lupas) - at aa 25 - 64\nd aa 145 - 172. 
SEG Low Complexity regions predicted \regions: a t aa 42-79 and aa 
165-179. 



[00130] It is to be understood that the foregoing description and specific 
embodiments shown herein are merely illustrative of the invention and its 
principles. Modifications and additions to the invention may readily be made 
by those skilled in the art without departing from the spirit and scope of this 
invention. 
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[00131] 



The articles in scientific periodicals and any patent 



literature cited hereinabove are hereby expressly incorporated by reference 
in their entireties for all purposes. 



Altschul et al., Nucl. Acids Res. 25: 3389. 1997. 

Ausubel et al., "Current Protocols in Molecular Biology" John Wiley 8b Sons Inc, 
10 1994-1998, Chapter 15. 

Barrett JH 1992. Genetic mapping based on radiation hybrid data. Genomics. 
13: 95-103. 



,p 15 Bock P, Abdel-Moneim M, Egerbacher M. Development of Pancreas. 1997. 
^ Microscopy Research & Technique. 37: 374-383. 



20 Brown EJ, Schreiber SL. 1996. A signaling pathway to translational control. 
Cell. 86: 517-520. 

Chomczynski, P. and Sacchi, N., Anal. Biochem. 162:156169 

25 Clemens MJ, Bommer U-A. 1999. Translational control: the cancer connection. 
The International Journal of Biochemistry and Cell Biology. 31: 1-23. 

Douillard and Hoffman, Basic Facts about Hybridomas, in Compendium of 
Immunology Vol. II, ed. by Schwartz, 1981 

30 



5 



BIBLIOGRAPHY 



Bonner and Laskey, Eur. J. Biochem. 46: 83, 1974. 



-44- 



Kohler and Milstein, Nature 256:495-499, 1975 

Kohler and Milstein, European Journal of Immunology 6:511-519, 1976. 
Marmur and Doty, J. Mol. Biol. 5: 109, 1962. 

Massari ME, Murre C. 2000. Helix-loop-helix proteins: regulators of 
transcription in eukaryotic organisms. Molecular and Cellular Biology. 20: 429- 
440. 

Needleman and Wunsch, J. Mol. Biol. 48: 443-453, 1970 

Sambrook et al (eds). Molecular Cloning. A Laboratory Manual. Cold Spring 
Harbor Laboratories, Cold Spring Harbor, NY, USA, 1989 

Schafer, D.F. and Sorrell, M.F., Lancet 363:1253-1257, 1999 

Willis AE. 1999. Translational control of growth factors and proto-oncogene 
expression. The International Journal of Biochemistry and Cell Biology. 31: 73- 
86. 



-45- 



